Question Analysis Report

Generated: 2025-07-03T21:22:00.704816

Executive Summary

Dataset Size:
9,098 observations
Features:
478 total
Models Analyzed:
10 outcomes
Best R²:
0.820

Model Performance Summary

Outcome Intercept Adj. R² F-statistic F p-value AIC BIC RMSE N Significant Features High VIF Features Mean VIF Max VIF Sample Size
proportion_left_leaning 0.1706 0.1532 0.1489 35.59 0.0000 71268.0 71602.5 12.1238 17 0 1.55 3.74 9,098
proportion_right_leaning 0.3835*** 0.0361 0.0312 7.37 0.0000 40950.9 41285.4 2.2911 20 0 1.55 3.74 9,098
proportion_center_leaning -0.8541 0.8162 0.8152 873.51 0.0000 71645.4 71979.8 12.3779 17 0 1.55 3.74 9,098
proportion_high_quality -3.8070*** 0.8201 0.8192 897.10 0.0000 72149.7 72484.1 12.7257 21 0 1.55 3.74 9,098
proportion_low_quality 0.5097* 0.0610 0.0562 12.78 0.0000 56959.6 57294.0 5.5224 18 0 1.55 3.74 9,098
news_proportion_left_leaning 16.3776*** 0.1197 0.1152 26.75 0.0000 88064.8 88399.3 30.5167 21 0 1.55 3.74 9,098
news_proportion_right_leaning 2.1492*** 0.0579 0.0531 12.10 0.0000 68057.1 68391.5 10.1625 18 0 1.55 3.74 9,098
news_proportion_center_leaning 80.7296*** 0.1413 0.1369 32.37 0.0000 88680.4 89014.9 31.5668 22 0 1.55 3.74 9,098
news_proportion_high_quality 71.0018*** 0.1177 0.1132 26.25 0.0000 90342.2 90676.6 34.5854 23 0 1.55 3.74 9,098
news_proportion_low_quality 4.8523*** 0.0456 0.0408 9.41 0.0000 76387.2 76721.7 16.0628 15 0 1.55 3.74 9,098

Correlation Matrix

Feature Importance

Regression Coefficients by Outcome

proportion_left_leaning (R² = 0.153, 28 features)

proportion_right_leaning (R² = 0.036, 28 features)

proportion_center_leaning (R² = 0.816, 28 features)

proportion_high_quality (R² = 0.820, 28 features)

proportion_low_quality (R² = 0.061, 28 features)

news_proportion_left_leaning (R² = 0.120, 28 features)

news_proportion_right_leaning (R² = 0.058, 28 features)

news_proportion_center_leaning (R² = 0.141, 28 features)

news_proportion_high_quality (R² = 0.118, 28 features)

news_proportion_low_quality (R² = 0.046, 28 features)

Model Family Comparisons

proportion_left_leaning

proportion_right_leaning

proportion_high_quality

proportion_news

num_citations

Multicollinearity Diagnostics

Interpretation: Variance Inflation Factor (VIF) measures multicollinearity.

proportion_left_leaning (High VIF: 0, Mean VIF: 1.55)

proportion_right_leaning (High VIF: 0, Mean VIF: 1.55)

proportion_center_leaning (High VIF: 0, Mean VIF: 1.55)

proportion_high_quality (High VIF: 0, Mean VIF: 1.55)

proportion_low_quality (High VIF: 0, Mean VIF: 1.55)

news_proportion_left_leaning (High VIF: 0, Mean VIF: 1.55)

news_proportion_right_leaning (High VIF: 0, Mean VIF: 1.55)

news_proportion_center_leaning (High VIF: 0, Mean VIF: 1.55)

news_proportion_high_quality (High VIF: 0, Mean VIF: 1.55)

news_proportion_low_quality (High VIF: 0, Mean VIF: 1.55)

Summary Statistics

Variable Type Mean Std Min Max N Missing
num_citations Citation Outcome 5.7652 5.1669 0.0000 46.0000 32,400 0
proportion_high_quality Citation Outcome 8.9662 21.3873 0.0000 100.0000 32,400 0
proportion_left_leaning Citation Outcome 1.6659 7.4564 0.0000 100.0000 32,400 0
proportion_right_leaning Citation Outcome 0.0819 1.2404 0.0000 50.0000 32,400 0
news_proportion_high_quality Citation Outcome 21.8282 39.9889 0.0000 100.0000 32,400 0
news_proportion_left_leaning Citation Outcome 4.7865 18.8207 0.0000 100.0000 32,400 0
news_proportion_right_leaning Citation Outcome 0.3746 5.5664 0.0000 100.0000 32,400 0
proportion_news Citation Outcome 10.7818 23.2094 0.0000 100.0000 32,400 0
turn_number Question/Response Feature 1.7057 2.0636 1.0000 39.0000 32,400 0
total_turns Question/Response Feature 2.5335 3.5807 1.0000 50.0000 32,400 0
question_length_chars_log Question/Response Feature -0.0000 1.0000 -3.8234 2.6358 32,400 0
question_length_words_log Question/Response Feature 0.0000 1.0000 -2.2585 2.9189 32,400 0
response_length_log Question/Response Feature -0.0000 1.0000 -7.0885 3.1220 32,400 0
response_word_count_log Question/Response Feature -0.0000 1.0000 -5.6188 2.9660 32,400 0
model_family_google Model Family 7,563 observations 23.3% - - 32,400 0
model_family_openai Model Family 11,168 observations 34.5% - - 32,400 0
model_family_perplexity Model Family 13,669 observations 42.2% - - 32,400 0

Technical Details

Regression Method: OLS_statsmodels

PCA Precomputed: True

PCA Used: True

Total Features: 47